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ON PROXIMITY MEASURES FOR GRAPH VERTICES 1 



P. Yu. Chebotarev and E. V. Shamis UDC 519.173:512.643.8 

We study the properties of several proximity measures for the vertices of weighted multigraphs and 
multidigraphs. Unlike the classical distance for the vertices of connected graphs, these proximity 
measures are applicable to weighted structures and take into account not only the shortest, but also 
all other connections, which is desirable in many applications. To apply these proximity measures to 
unweighted structures, every edge should be assigned the same weight which determines the proportion 
of taking account of two routes, from which one is one edge longer than the other. A topological 
interpretation is obtained for the Moore-Penrose generalized inverse of the Laplacian matrix of a 
weighted multigraph. 



1. INTRODUCTION 

Proximity measures for the vertices of directed and undirected graphs arise in many applied settings. The 
range of applications of such functions is rather wide, including chemistry [1-7], crystallography [8], epidemiology [9], 
urban planning [10], organizational management [11], political sciences [12], aggregation of preferences [13, 14], etc. 
The most steadfast interest in them is displayed in mathematical sociology [15-25] in connection with the problem 
of measuring centrality in social networks. This important concept is multifarious, and a great variety of model and 
heuristic approaches were proposed to define its numerical representation. Note that graph theorists mainly dealt 
with the classical distance between the vertices of a connected graph [26] , which is the length of the shortest path 
between them. At the same time, the presence of additional, even longer paths is of practical importance in many 
applications. For example, if the shortest road between two places is congested, a portion of goods can be delivered 
by a longer path (detour). 

In this paper, we study the properties of several "sensitive" proximity measures that take into account all 
connections in a multigraph. Their common feature is the calculation (with appropriate weights) of all structures of 
a certain type that connect two vertices: paths, routes, routes with drains, trees, and so forth. For these measures, 
the weights of edges determine the proportion of taking account of longer paths in comparison with shorter ones. 
In some cases, the weight of an edge has the meaning of a "transfer factor" that specifies the losses (of substance, 
influence, reliability, etc.) when moving through a graph. 

2. SOME NORMATIVE PROPERTIES OF PROXIMITY MEASURES 

Suppose that G is a weighted multigraph with vertex set V(G) — {1, . . . ,n} and edge set E(G); T is a 
weighted multidigraph with vertex set V(r) = {1, . . . , n} and arc set E(T); the weights of edges and arcs are denoted 
by (the pth edge/arc from i to j) and are strictly positive. The terms "graph" and "subgraph" will be used as 
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generic ones (allowing multiple, weighted, and directed arcs). 

Suppose that E = (e j •) is the matrix of total weights of edges (arcs) for all pairs of vertices: 

p=l 

where a^- is the number of edges (arcs) that connect i to j. Let H be a subgraph of G. The product of the weights 
of all edges of H will be termed the weight of H and denoted by e{H). The weight of a directed subgraph of T is 
defined similarly. The weight of a subgraph without edges/arcs is set to be 1. For any nonempty set of subgraphs Q, 
its weight is 

e(Q) = J2 <H). (1) 
Heg 

The weight of the empty set is zero. P = (p f .) will designate various n x n-matrices of proximity (accessibility, 
connectedness) measures for the vertices of G or T. 

Let us formulate a number of conditions whose fulfillment is rather natural to the proximity measures under 
consideration. Most of them were introduce in connection with the relative forest accessibility of graph vertices [25] . 

Symmetry. For every multigraph, the matrix P is symmetric. 

This condition is hardly natural as applied to directed graphs. In the statements below, symmetry always 
stands for that applied in the undirected case. 

Nonnegativity. For any multigraph (multidigraph) , p { - > 0, i,j = l,...,n. 

Reversal property. For any multidigraph, the reversal of all its arcs (provided that their weights are 
preserved) results in the transposition of the proximity matrix. 

Diagonal maximality. For any multigraph (multidigraph) and anyi,j = 1, . . . , n such that i ^ j, p u > p {j 
and p u > p 3i hold. 

This condition requires a stronger relation of each vertex to itself than to any other vertex. If a proximity 
measure has the reversal property, then the two inequalities of the diagonal maximality are equivalent in the case of 
directed graphs as well as in the undirected case. Since all the measures applicable to directed graphs hereinafter 
possess the reversal property, we will prove only the first inequality of diagonal maximality. 

Triangle inequality for proximities. For any multigraph and for any i,j,k = l,...,n, p { j +p ik —Pjt < Pu 
holds. If, in addition, j — k and i ^ j, then the inequality is strict. 

The triangle inequality for proximities is also meaningful as applied to directed graphs. However, in this case 
it requires special consideration, since different orders of subscripts (p^ or pj { , etc.) give rise to several modifications. 
In this paper, a "directed" triangle inequality for proximities is used in some proofs, but in the main text we deal 
with its undirected version only. 

Consider the index 1 



dij = Pu +Pjj -Pij -Pji, i,j = l,...,n. (2) 

Metric representability of proximity. The index is a distance between the vertices of a multigraph, 
i.e., it satisfies the axioms of a metric. 

This condition is always satisfied, provided that symmetry and the triangle inequality for proximities hold 
true [29]; the latter condition turns out to be closely related to the usual triangle inequality for the distance d^. 
Moreover, some kind of duality has been established between the metrics defined on an arbitrary set and the functions 
that satisfy the triangle inequality for proximities and an additional normalization condition [29] . 

Let us adduce an example not dealing with graphs to illustrate the triangle inequality for proximities and 
the metric (2). Let p(x, y) be the function, defined on the pull-back of some family X of finite sets, that takes every 
pair of sets (x, y) to the number \x (~1 y\ of elements in their meet. Then, for any x,y,z 6 X, 

p(x,x) = \x\ > \x n y\ + \x n z\ - \x n y n z\ > \x n y\ + \x n z\ - \y n z\ 

= p{x,y)+p(x,z)-p(y,z), (3) 

1 Transformations of the form of (2) in cither explicit or implicit form appear in many papers, e.g., [3, 6, 7, 9, 19, 25, 27, 28], and also 
in the theory of linear statistical models. 



1444 



i.e., the triangle inequality for proximities is fulfilled (since the first inequality in (3) is strict at x ^ y and y = z). 
The transformation (2) applied to p(x, y) generates the usual metric on finite sets: the distance between x and y is 
the number of elements in their symmetric difference. 

In the sequel, we assume that there is one path of length from any vertex to itself. 

Disconnection condition. For any multigraph G (multidigraph T) and for any i,j = l,...,n, p i3 ■ = iff 
there is no path from i to j in G (in V). 

Connectivity condition (a consequence of the disconnection condition). 

(1) For any multigraph, the matrix P can be reduced to a block-diagonal form, where all block entries are 
strictly positive, all other entries being zero. The matrix P is strictly positive iff G is connected. 

(2) For any i,j,ke V(G), Pij > and p jk > imply p tk > 0. 

The following normative property can be considered as an extension of diagonal maximality. 

Transit property. For any multigraph G and any i,k,t € V(G), if G contains a path from i to k, i ^ k ^ t, 
and each path from i to t includes k, then p ik > p it . The same applies to multidigraphs. 

Monotonicity. Suppose that the weight of some edge (arc) e p kt in a multigraph G (multidigraph T) 
increases or a new edge (arc) from k to t appears. Then 

(1) Ap kt > 0, and for any i,j — l,...,n, {i,j} ^ {k,t} implies Ap kt > Ap^; in the directed case, the 
hypothesis is weakened to [i ^ k or j ^ t] ; 

(2) for any i = 1, . . . , n, if there is a path from i to k, and each path from i to t includes k, then Ap it > Ap ik ; 

(3) for any i l , i 2 = 1, . . . , n, if i 1 and i 2 can be substituted for i in the hypothesis of item 2, then Pi i docs 
not increase. 

Item 3 can be interpreted as follows: the proximity between two vertices does not increase whenever the 
bond that appears or becomes stronger is extraneous for the connection of these two vertices. 

3. PATH ACCESSIBILITY 

The simplest proximity measure that takes into account not only the shortest path between vertices is path 
accessibility. The path accessibility of j from i is defined as the total weight of all paths from i to j. There are two 
ways of defining this measure at j — i. First, "paths from i to i" can be interpreted as simple cycles from i to i plus 
the path of length whose weight is unity. The second possibility is to assume that the latter trivial path is the only 
path from i to i. Note that discarding this trivial path leaves no chance of meeting diagonal maximality. We adopt 
the first definition, which is more informative, though more disputable, but the subsequent discussion is applicable 
to the second definition too. 

Path accessibility can serve as a proximity measure only if a shorter path is assigned a greater weight than 
a covering longer path (cf. transit property). If the weight of a path is the product of the weights of the constituent 
edges/arcs (as we assume hereinafter), this requires that the edge/arc weights belong to the interval [0, 1]. In this 
way, path accessibility (as well as the subsequent indices) corresponds to the models where every edge weight is a 
"transfer factor" that determines the weakening of "vertex influence" with movement away from the vertex along 
the edge. In some cases, such a model can be applicable to transformed data that result after multiplying each edge 
(arc) weight by a constant factor t, < r < (maxj^p e? ) -1 . With the same effect, the weight of a path can be 
defined as Jl( re ( e ))' with the product over all edges (arcs) e in the path. In the same manner, each edge/arc of an 
unweighted graph can be assigned the same weight r. While talking about edge/arc weights, we will have in mind 
the weights so obtained too. 

To choose r for unweighted graphs, one has to estimate the proximity of two vertices connected by an edge 
compared to the proximity of two vertices connected by a two-edge path. If the latter vertices appear to be two times 
"farther," then r = 1/2 can be chosen. In this case, two vertices connected by a three-edge (four-edge) path are 
four times (respectively, eight times) farther. If the respective decrements of 3 and 4 seem to be more natural, one 
has to take another model, which the reader can easily construct. Here, the reciprocal weight of a path is the sum 
of the reciprocal weights of the constituent edges (harmonic rather than geometric decrease). The original concept 
in such models is distance, whereas proximity can be introduced as the reciprocal value. Undoubtedly, these models 
are natural, but we do not consider them in this paper. Some of their properties are discussed in [9, 30]. 

Let P be the matrix whose entries are the values of path accessibility for all pairs of vertices. 

Proposition 1. Path accessibility has the following properties: symmetry, nonnegativity, reversal property, 
and disconnection condition. Moreover, if e\- < e for all i, j = 1, . . . , n, p < (where e is a specihe constant 
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dependent on n and the greatest possible number m of multiple edges/arcs), then diagonal maximality, the triangle 
inequality for proximities, the transit property, and monotonicity are true. 
The proofs of all statements arc given in the Appendix. 

Since the changes in proximities under special modifications of graphs are of interest, restrictions on the 
edge/arc weights are introduced for certain families of graphs rather than for individual graphs. Here, such a family 
is determined by n and m. 



4. CONNECTION RELIABILITY AS A VERTEX PROXIMITY MEASURE 



Let us assume that all edge/arc weights belong to the interval [0, 1], and consider them as the probabilities 
of edge/arc intactness. Define p^ to be the reliability of connections between i and j, i.e., the probability that at 
least one intact path between i and j survives, provided that all edge/arc failures are independent; let P = (p^) be 
the matrix of connection reliabilities for all pairs of vertices. Connection reliability can be considered as a proximity 
measure for graph vertices. Let us point out some advantages of this measure. First, it is based upon a natural 
model. Second, it is not always appropriate that the proximity be doubled as all paths between a pair of vertices are 
duplicated (this is the case when path accessibility is used); in some cases, the increase should be more moderate. 
This property features connection reliability. 

According to a well-known theorem (see, e.g., [31, p. 10]), 

Pij(G) = ]T Pr(R k ) - WkRt) + ]T PriRkRM) -... + (-l) h+1 Pr^ifc • • • R h ), (4) 

k k<t k<t<l 

where i?2, • • • , Rh are all paths between i and j; Pr(RkR t ) = e{Rk U Rt), where R k U Rt is the subgraph that 
contains those edges (arcs) that belong to Rk or R t , and so forth. By virtue of (4), connection reliability is a natural 
modification of path accessibility that takes into account the degree of overlapping for different paths between two 
vertices. 

Connection reliability possesses all the normative properties listed in Sec. 2, though for some of them the 
strict inequality ■ < 1 is necessary. 

Proposition 2. Connection reliability has the following properties: symmetry, nonnegativity, reversal 
property, disconnection condition, and item 3 of monotonicity. Diagonal maximality, the triangle inequality for 
proximities, the transit property, and items 1 and 2 of monotonicity hold true, provided that the intactness probability 
of each edge/arc is strictly less than 1; otherwise they are satisfied in a nonstrict form. 



5. ROUTE ACCESSIBILITY 



A special feature of path accessibility (which also applies to connection reliability) is the necessity of a logical 
algorithm for its calculation. The replacement of paths by routes reduces the problem to the inversion of a matrix 
(see, e.g., [8]). Moreover, the route accessibility of j from i has some relation to the following problem: find the 
probability that a random walk started at i is located at j at a "randomly chosen" moment. Note that the proximity 
measures originating from the analysis of Markov chains require special consideration. Interesting information on 
them can be found in [7, 9, 20, 32]. 

Consider the matrix P = (I — E) -1 , where E — (e^) is the matrix of total weights of edges (arcs) introduced 
above. Expand P as the sum of an infinitely decreasing geometric progression (not specifying the conditions of its 
validity so far): 

P = (I - E)- 1 = I + E + E 2 + .. . . (5) 
Let J\fij be the set of routes from i to j. Since the entries of E k are the total weights of k- length routes, (5) implies 

i.e., is the total weight of routes from i to j (at j = i, the route of length weighted by 1 is naturally taken into 
account). Therefore, P is the matrix of route accessibilities in a multigraph (multidigraph) . 
Equation (5) is valid if and only if 

|Ax | < 1, (7) 
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where \X 1 | is the spectral radius of E [33, Corollary 5.6.16]. 

Consider the upper bound for |A 1 | provided by the Gersgorin theorem (see [33]): 

n 

|Ai | < max hj I- ( 8 ) 

Let e max be an imposed upper bound for the edge/arc weights; suppose that m is the greatest possible 
number of multiple edges (arcs) incident to the same pair of vertices. Then 

n 

\\ I < max V |£y \<m(n- l)e max . (9) 

3 = 1 

Therefore, the validity of (7) (and thus of (5)) is provided by 

e max < (m(n-l))" 1 . (10) 

While on the subject of route accessibility, we will assume that the constraint (10) is satisfied (possibly, 
after the transformation of edge/arc weights mentioned in Sec. 3). A representation of the entries of P through the 
weights of specific connections in a digraph (this representation involves finite sums only and thus does not require 
any restrictions on the edge/arc weights) can be found in [34]. A useful review of results related to the calculation 
of routes in graphs is given in [35] . 

Proposition 3. Route accessibility has the following properties: symmetry, nonnegativity, reversal 
property, diagonal maximality, the triangle inequality for proximities (for the edge/arc weights not exceeding 
(mn) _1 ), the disconnection condition, the transit property, and items 1 and 2 of monotonicity. Item 3 of mono- 
tonicity is not valid for it. 

The triangle inequality has not yet been proved in the general case. The following proposition is used in the 
proofs of other properties and is worth mentioning in itself. 

Proposition 4 (on one-step increment of route accessibility for multidigraphs). Suppose that some arc 
weight e p kt in V increases by Ae kt > or an extra arc from k to t with a weight Ae kt is added to T. Let V be the 
new multidigraph and P' — P(T'). Then 



AP = hR, 



Ae 



where AP = P' — P, h = ^ _ ^ £ fc * - , and R — (r^ ) is the n x n-matrix with entries = p ik p t j 



6. RELATIVE FOREST ACCESSIBILITY FOR MULTIGRAPHS 



The notion of relative forest accessibility for multigraphs and multidigraphs was introduced in [25, 36], 
where we studied its properties in the case of multigraphs. In the present paper, we consider the undirected case too. 
Relative forest accessibility for multidigraphs is not one, but two complementary indices, calculated by counting the 
weights of converging and diverging spanning forests, respectively. None of the two possesses the reversal property 
of Sec. 2, but they have it "together" : the matrix of the first index for the multidigraph with reversed arcs equals 
the transposed matrix of the second index for the original multidigraph, and vice versa. Some other properties are 
also natural to apply to the pair of indices. Thereby, the consideration of the above-mentioned indices in this paper 
could excessively complicate its structure. In the next two sections, we study the limit properties of the relative 
forest accessibility measure for multigraphs. The corresponding limit properties for multidigraphs are substantially 
different, and they should be considered elsewhere. 

All assertions of Proposition 5 stated below, except for item 1 of monotonicity, are proved in [25] . Item 1 of 
monotonicity is proved in the Appendix. 

Recall that the Laplacian matrix (also called the Kirchhoff or the admittance matrix) of a multigraph G is 
the n x n-matrix L — L(G) — (£ij) with entries 

% = -^2 £ ij> J^h i,j = l,...,n, (11) 
P =i 

in = i = l,...,n, (12) 
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where is the number of (multiple) edges incident to i and j simultaneously. By (11) and (12), in is the total 
weight of edges incident to i (exclusive of loops). 
The matrix 

Q=(q ij ) = (I + L(G))- 1 . 

is the matrix of relative forest accessibilities of vertices in G. 

This term is suggested by the matrix-forests theorem [13, 25, 28, 36]. Suppose that T{G) — T is the set of 
all spanning rooted forests of multigraph G, and ^(G) = T 1 ^ is the set of those spanning rooted forests, in which 
i and j belong to the same tree rooted at i. A spanning rooted forest is an acyclic subgraph of G that has the same 
vertex set as G and one marked vertex (a root) in each component. 

THEOREM 1 (matrix- forest theorem for weighted multigraphs) [25, 36]. For any weighted multi- 
graph G, the matrix Q = (I + L(G))~ 1 exists and = e{j ri ^)/ i,j = l,...,n. 

Recall that, according to (1), e(j ry ) and e(T) are the total weights of forests that belong to JT U and J 7 , 
respectively. For the sake of unification, in the sequel we denote the matrix Q by P — (p- ) (as well as other matrices 
of proximity measures). 

The characteristic features of relative forest accessibility are doubly stochastic normalization (more precisely, 
its second condition) and macrovertex independence. 

Doubly stochastic normalization. For any multigraph G, 

(1) p i:j > 0, i,j = 1,. ..,n, and 

n n 

( 2 ) T,Pij = T,Pji = 1 , * = l,...,n. 

i=l i=l 

According to this condition, p^ can be interpreted as the share of the connectivity of i and j in the total 
connectivity of i (or j) with all vertices. This interpretation requires some explanation. Indeed, by virtue of symmetry, 
it requires that the "total connectivity" of all vertices be identical, irrespective of the difference in their position 
within a multigraph. This is realized with the aid of the diagonal entries of the matrix: if i is poorly connected with 
other vertices, then p u (which expresses the "solitariness" of i) is great, and hereby the "total connectivity" is the 
same as for all other vertices. 

Let D be a subset of the vertex set V(G). We say that D is a macrovertex in G, if for every i,j G D and 

k £ D , s ik = £ 3 k holds - 

The following property is a sufficient condition for the equality and stability of proximities. 

Macrovertex independence. Suppose that D is a macrovertex in G and i G D, j G D, k £ D. Then 
Pik = Pjk' an d Pik docs not vary when any new edges appear or the weights of any existing edges change inside D. 

Macrovertex independence substantially strengthens the following simple condition (which is not included 
in the list of Sec. 2, since it is obviously met by all proximity measures under consideration). 

Independence of other components. Let A and B be two different components of a multigraph. Then 
any addition, removal, or reweighting of edges (arcs) within B does not alter the values of proximity for the vertices 
that belong to A. 

Proposition 5. Relative forest accessibility for multigraphs has the following properties: symmetry, 
nonncgativity, diagonal maximality, the triangle inequality for proximities, the disconnection condition, the transit 
property, monotonicity, doubly stochastic normalization, and macrovertex independence. 

Thereby, relative forest accessibility for multigraphs possesses all normative properties of Sec. 2 without any 
restrictions on the weights of edges, and it features macrovertex independence and doubly stochastic normalization. 
Certainly, this does not raise relative forest accessibility over other proximity measures. Rather, this index perfectly 
corresponds to one possible concept of proximity specified by the properties listed in Proposition 5. 

7. COMPONENTS OF RELATIVE FOREST ACCESSIBILITY 

In this section, the relative forest accessibility for multigraphs is decomposed into components that correspond 
to the sets of forests with a varying number of trees. Next, we consider the notions of proximity that correspond 
to each component. Let v be the number of connected components in G; by Vi we denote the set of vertices of the 
component of G that contains vertex i (i = 1, . . . , n) . 
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THEOREM 2 (a parametric version of the matrix- forest theorem for multigraphs). For any- 
weighted multigraph G and any r > 0, let Q{t) = (^ (t)) be the matrix (I + tL)~ x . Then Q(t) exists and 



n—v 



M^E T H^')/E^HF fe ), i,j = l,...,n, (13) 

fe=0 fe=o 

where T k is the set of spanning rooted forests in G that consist of k edges, and T k C J^ k is its subset comprising 
those forests in which j belongs to a tree rooted at i. 

n 

By Proposition 5, the matrix of relative forest accessibilities is doubly stochastic, whence ( lij{ T ) = 1> 

i = 1, . . . , n, r > 0. The following proposition states a stronger fact, namely, the stochastic property is true for the 
coefficients at every exponent of r in (13). 

Proposition 6. For any i = 1, . . . , n and k = 0, . . . , n — v, we have 

^s{Fii)=s{T k ). (14) 
i=i 

The matrices Q(t), t > 0, make up a parametric family of relative forest accessibility indices which obviously 
have the same basic properties as Q = Q(l). By (13), Q(t) can be represented as 

Q(r) = (r°Q + r 1 ^ + . . . + T n ~ v Q n - v ) , (15) 

S{T) 

n—v 

where s(r) = £ r k e(Tk), Qk = {% tij ), and q fe y = e{T%), k = 0, . . . , n - v, i, j = 1, . . . , n. 

k=0 

Every matrix Qk, k = 0, . . . , n — v, reflects a specific vertex proximity. Let us consider them in some detail. 
First, Qo = I, i.e., the "proximity" specified by Qo is simply identity. Further, the entry (Zi^-, j ^ i, of Q\ is equal to 
the total weight of the edges in G that are incident to i and j. Generally, the entry q k i - of Q k is distinct from zero 
if and only if G contains some paths of length k or shorter between i and j. The corresponding notion of proximity 
ignores all paths of length k + 1 or longer. Whenever k > |Vi| max — 1 (where |Vi| ma x is the maximum number of 
vertices among the components of G), the proximity corresponding to Q k takes into account all paths in G. 

Recall that Vi is the set of vertices in the component of G that contains i. To examine the proximity 
corresponding to Q n - V , we introduce the matrix J(G) = J = 



Jij 




if jets, 

otherwise 



and prove the following lemma. 
LEMMA 1. 



Qn-v = e{Tn-v) J ■ (16) 



As mentioned above, the "proximity" that corresponds to Q is identity. By Lemma 1, the matrix Q n - V 
represents an opposite concept of proximity: all vertices that belong to the same component of G are equally "close" 
to each other, and the value of their proximity is inversely proportional to the number of vertices in the component. 
Thus, the proximity to vertex i is uniformly distributed over the component of G that contains i. If G is connected, 
then J = (l/n)J, where J is the n x n-matrix having all entries one, and so all entries of Q n -v are e(j 7 n - v )/n. For 
all matrices Q k , k — 0, . . . , n — v, the proximity of two vertices from different components of G is zero. 

COROLLARY 1. lim Q(t) = J. 

T — >00 

Corollary 1 follows directly from Theorem 2 and Lemma 1. 

Remark 1. The matrix Q„_„_i is of special interest. Its entry q n _ v _ l i j is the total weight of those 
spanning rooted forests in G that 

(1) have two trees in one component of G and one tree in each of the others, and 

(2) have i and j in the same tree rooted at i. 

Among the matrices Q k , k = 0, . . . , n — v, the matrix Q n -v-i is the most similar (in the properties) to the 
matrices Q(t) of relative forest accessibility. Indeed, by (15)— (16), the comparison of two entries of Q(t) at a large r 
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is determined by the comparison of the corresponding entries of Q n _„_i. Only when the latter entries are equal do 
the corresponding entries of Qk at k < n — v— 1 matter. Dealing with examles convinces us that the situations where 
two entries of Q n -v-i are equal, whereas the corresponding entries of Qk, k < n — v — 1, vary, are not frequent, and 
indeed it is not easy to intuitively discriminate between the compared proximities in these cases. Still, an important 
exception exists. As mentioned above, k > |Vi| max — 1 is necessary and sufficient for Qk to take into account all 
paths in G. If all components of G, except one, are separate vertices or G is connected, then |Vi| max — 1 = n — v. In 
this case, if a pair of vertices in the nontrivial component is connected only by paths of length n — v (a chain graph), 
then the corresponding entry of Q n - v -\ is zero, and so Q n -v-i violates the disconnection condition. Note that 
some weighted sums of Q n _„_i and Q n ~v are free of this flaw. Such linear combinations are studied in the following 
section. Moreover, we show that Q n -v-i is closely connected with the matrix L + , the Moore-Penrose generalized 
inverse of L. More precisely, L + is the sum of Q n -v-i and Q n ~ v with definite coefficients. 

8. ACCESSIBILITY VIA DENSE FORESTS CONNECTED WITH THE GENERALIZED 
INVERSION OF THE LAPLACIAN MATRIX 

This section is devoted to weighted sums of matrices Q n _„_i and Q n - V = e{T n -v) J- A number of papers 
[6, 7, 9, 19] use, either explicitly or implicitly, proximity matrices whose generalization to multicomponent graphs 
can be represented as (L + a J) , where a > 0. The aims of this section are as follows: 

(1) to provide a topological interpretation of such a proximity in the case of arbitrary multigraphs (it is 
based on the matrices Q n -v-i and Q n _ v ); 

(2) to establish its relation with the matrix L + , the Moore-Penrose generalized inverse of L, and 

(3) to ascertain its properties. 

We will show that (L + aJ) 1 with a sufficiently small a is a weighted sum of Q„_„_i and Q n -v with 
positive coefficients and satisfies a number of conditions of Sec. 2. 
To solve the foregoing problems, we will need the matrix 



which has many remarkable properties. Four representations for Q are stated below (Proposition 7-9 and Theorem 3). 

Proposition 7. For any the matrix (L + aJ) is invertible, and Q = (L + a J)^ 1 — or 1 J . 

By Proposition 7, the difference between Q and (L + a J)^ 1 is represented by a matrix whose entries are 
constant within each component of G. In [6, 7, 9, 19], matrices of the form of (L + a J) -1 are mainly used for 
transformations such as (2), where, if one pays no regard for intercomponent entries, they can be equivalently 
replaced by Q. 

Recall that for any rectangular complex matrix A, the Moore-Penrose generalized inverse of A is the unique 
matrix A + such that 

(1) AA + and A + A are Hermitian matrices, 

(2) AA + A = A, and 

(3) A + AA+ = A+. 

Proposition 8. For any weighted multigraph G, the matrix Q is the Moore-Penrose generalized inverse 
ofL = L(G), that is, Q = L + . 

Since L is a square matrix, and AA + ~ A + A (which follows from the proof of Proposition 8), the matrix Q 
is the group inverse of L (cf. [30]). Geometric interpretations for L + are given in [27]. 

It turns out that L + can be obtained by a passage to the limit from the parametric matrix Q(r) of relative 
forest accessibilities (cf. Corollary 1). 

Proposition 9. L + = lim t(Q(t) — J). 



Proposition 9 and Theorem 2 enable one to obtain a topological interpretation for L + — (1^)- 

THEOREM 3 (a topological interpretation for the matrix L + , the Moore Penrose generalized 
inverse of L): 



Q = (L + J) 1 - J, 



(17) 




(18) 
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Here, the numerator is the result of centralization: the ij-entry minus the zth-row mean of Q n -v-i (see (14)). 
By Theorem 3, the definition of J, and Lemma 1, one has 

+ _ e(T n - v -i) ( 1 1 q \ 

It- \ \ It- \^in — v — 1 / n Vn- v I 

£{J-n-v) \£\J-n-v-l) e\J-n-v) J 

= —7— T {Qn-v-1 - s{Tn-v-l) j) ■ 

Another representation of L + for connected weighted graphs was obtained in [30] . 

Can L + be considered as a matrix of vertex proximities? By (18), this "proximity" equals zero for vertices 
from different component of G, and so does the sum of "proximities" of each vertex with the vertices of the same 
component. The latter does not match an intuitive idea of proximity. First, nonnegativity is violated; second, the 
"proximity" of poorly connected vertices from the same component turns out to be less than that for any vertices 
from different components. 

Now, let us return to the matrices (L + ct J) -1 . Propositions 7-9 and Eq. (19) imply the following identities: 

(L + aJ)- 1 = L++a- 1 J 

= lim t(Q(t) - J) + a- 1 J 

T — >00 

- —— 7 Qn-v-1 + « 7— r- Qn-v 

£{Tn-v) V V £{Tn-v) J J 

- ^Qn-v-1 + [a — r I J . 

Thus, whenever < a < s(j-n-v) I '^(J-n-v-i) , the matrix (L + a J)" 1 is the sum of Q„- v -i and Q n -v with 
positive coefficients. Let a dense forest be a spanning rooted forest in G with n-uorii-u-1 edges. Then the 
proximity measure (22) with < a < £(j r „- t ,)/£(jr„_ t ,_i) can be referred to as accessibility via dense forests. 

Proposition 10. The accessibility via dense forests in the case of multigraphs has the following properties: 
symmetry, nonnegativity, diagonal maximality, the triangle inequality for proximities, the disconnection condition, 
and the transit property. It does not satisfy monotonicity. 

It is interesting to examine the nature of the violation of monotonicity. It follows from (21) that whenever k 
and t belong to the same component of the original multigraph, monotonicity is valid in a nonstrict form, i.e., all strict 
inequalities are replaced by nonstrict ones, which can be regarded as acceptable. Rough violations of monotonicity 
(namely, Ap fct < Ap^ and Ap fct < 0) only occur when k and t originally belong to different components of G. 
This suggests an idea of searching for a better modification of accessibility via dense forests. The scrutiny of this 
question, as well as the examination of the metric corresponding (in the sense of [29]) to this proximity measure (see 
[6, 7, 9, 30]), is beyond the scope of this paper. 

9. ON SOME PECULIARITIES OF THE PROXIMITY MEASURES 

A specific feature of path and route accessibilities is the necessity of imposing rather strong restrictions on 
the weights of edges (arcs) to guarantee the properties of Sec. 2 convergence (in the case of route accessibility). 
These restrictions imply a fast decrease of proximity with movement away from a vertex along an edge chain. A 
characteristic feature of connection reliability is the effect of saturation. If, for example, two vertices are connected 
by an edge, the weight of which is close to 1, then the addition of other paths between them leaves the value of 
proximity almost the same. In addition, all diagonal entries are ones, i.e., they do not characterize self-relations of 
any kind. Accessibility via dense forests violates monotonicity when two components of a graph get a connection; 
it only satisfies the nonstrict version of monotonicity, when a graph is changed within components. Unlike relative 
forest accessibility, here the triangle inequality is also satisfied in a nonstrict form, provided that i,j, and k are 
distinct. On the other hand, the metric derived from this proximity measure by (2) coincides with the classical graph 
metric in the case of trees [6]. For a further study of this metric, see [30]. The relative forest accessibility differs 
from the other proximity measures by the very fact of its relativeness. A manifestation of this is the stochastic 
normalization property of the matrices Q and Q(t) for digraphs and doubly stochastic normalization in the case of 
undirected graphs. As a corollary, the addition of new edges (arcs) in a graph does not increase all proximities; some 
of them will necessarily decrease. The corresponding "absolute" proximity measure can be obtained by considering 
the adjugate of the matrix (7 + tL) instead of Q(t) = (I + tL)^ 1 . In addition, relative forest accessibility features 

1451 



(19) 



(20) 
(21) 

(22) 
(23) 



u 



Figure 1: Example 1. 
x x 




t 

Figure 2: Example 2. 



macrovertex independence, which is not always desirable. To illustrate these and some other peculiarities of the 
proximity measures under study, we shall consider a few simple examples. 

For the graph in Fig. 1, path accessibility connection reliability, and route accessibility give p ik < p it . 
Seemingly, it would otherwise be unnatural, since i and t are connected not only by an edge (as i and k are), but 
also by a path of length 2 (iut). Nevertheless, the relative forest accessibility gives p ik = p it = p iu (this follows from 
macrovertex independence: {k,t, u} is a macrovertex). The same result is provided by the accessibility via dense 
forests. Macrovertex independence is appropriate when any connections within a macrovertex can be regarded as 
its "domestic affairs." For example, if each professor gives his/her lectures to all students (then the students form a 
macrovertex) , and the students write them down verbatim, then no reading or rewriting of the notes of each other 
can help them learn anything more (i.e., to approach the knowledge of the professors). 

The following example illustrates some peculiarities of the path and route accessibilities. In Fig. 2, i is 
connected with k by two paths, as well as with t, and the weights of these paths are equal (provided that the weights 
of all edges are equal). Hence, the path accessibilities p ik and p it are also equal. But the paths that connect i to 
t have a common edge. Therefore, connection reliability gives p ik > p it . The same result holds for relative forest 
accessibility and accessibility via dense forests. In contrast, route accessibility provides p ik < p it . This is because 
there exist two paths of length two from x to t and only one path of length two from x 1 (or from x 2 ) to k. As a 
result, there are eight routes of length seven from i to t and only four routes of length seven from i to k. 

Furthermore, the proximity measures at hand behave differently as applied to cycles. The cycle in Fig. 3 
has no influence on the values of path accessibility and connection reliability between i and t, i.e., p it = p ik (if all 
edge weights are equal) . Using route accessibility, we have p it > p ik . At the same time, relative forest accessibility 
provides p it < p ik , as the approach of i and t to the vertices of the cycle (owing to its appearance) moves them away 
(in the relative account) from each other. The same holds for the accessibility via dense forests. 

Note finally that for path accessibility, connection reliability, and the measures representable by weighted 
sums of the matrices Qi, . . . , Q n -v with fixed weights, the values of proximity linearly depend on the weights of 
edges (arcs), whereas for the other measures at hand, this is not the case. 

Thus, the proximity measures under discussion have significantly different properties. At the same time, 
"almost all" of them possess "almost all" of the "basic" properties formulated in Sec. 2 

10. CONCLUSION 

In this paper, we have dealt with several proximity measures for the vertices of directed and undirected 
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Figure 3: Example 3. 



Property 


Paths 


Reliability 


Routes 


Forests 
(undirected) 


Dense forests 
(undirected) 


Symmetry 


+ 


+ 


+ 


+ 


+ 


Nonnegativity 


+ 


+ 


+ 


+ 


+ 


Reversal property 


+ 


+ 


+ 


X 


X 


Diagonal maximality 


+* 


+* 


+ 


+ 


+ 


Triangle inequality 
for proximities 


+* 


+* 




+ 


+ 


Disconnection condition 


+ 


+ 


+ 


+ 


+ 


Transit property 


+* 


+* 


+ 


+ 


+ 


Monotonicity (1) 


+* 


+* 


+ 


+ 




Monotonicity (2) 


+ 


+* 


+ 


+ 




Monotonicity (3) 


+ 


+ 




+ 





+* is valid under some restriction and/or in the nonstrict form. 

+** was proved under an additional constraint. 

x inapplicable, as only undirected graphs were considered. 



Table 1: Some properties of proximity measures for graph vertices. 



multigraphs and considered their properties. These properties and the informal discussion of the previous section 
can help one choose adequate proximity measures when exact mathematical models arc lacking. 

A common feature of the indices considered in this paper is the measurement of the proximity (accessibility, 
connectivity) of two vertices by the total weight of certain substructures that "connect" these vertices. As such 
substructures, we examined paths (in particular, taking into account their overlaps), routes, spanning rooted forests, 
and "dense" spanning rooted forests. The weight of a substructure was defined as the product of the weights of the 
constituent edges (arcs). Within this approach, a proportional modification of all edge weights is needed in some 
cases, as well as assigning the same weight to all edges (arcs) of unweighted graphs. In conclusion, let us indicate 
some proximity measures that do not enter into the scope of the present paper. These are the indices dual (in the 
sense of [29]) to the classical distance for connected graphs and to some nonclassical graph metrics [7], maximum flow 
(minimum cut) between vertices [9], and a number of measures related to random walks in graphs (see [32, 9, 7]). 



APPENDIX 



Proof of Proposition 1. Symmetry, nonnegativity, reversal property, and disconnection condition 
immediately follow from the definition of path accessibility. To prove the remaining properties, let us find e 
guaranteeing that whenever the weights of all edges (arcs) are less than s , p^ < 1 holds for all i and j =/= i. Let m 
be the greatest possible number of edges (arcs) incident to the same pair of vertices. Note that when a multigraph 
G is complete (i.e., exactly m edges are incident to each pair of vertices), and the weights of all edges are e, then at 

n-l 

j ^ i, p { j = A h ^Z 1 2 (£m) k , where A^z} 2 is the number of permutations of n — 2 things taken k — 1 at a time. Now, 
fe=i 

we equate this expression to unity and assign to e the positive root of the equation obtained. 

Henceforth, we will assume that < e for all edge weights e\y As p^ is maximal in a complete multigraph, 
this will guarantee 

Pij < 1, i,j = -,n, i ± j, (24) 



1453 



for all weighted multigraphs on n vertices with the number of multiple edges not greater than m. The same constraint 
can be obtained for the weights of arcs in multidigraphs. 

Diagonal maximality follows from the inequalities p u > 1 and (24). 

Prove the triangle inequality for proximities. At i = j or i = k, the inequality reduces to equality. Suppose 
that i ^ j and i ^ k. Note that whenever all paths from j to k pass through i, p- k = p^p ik holds; otherwise 
Pjk ^ PjiPik- Let C be the total weight of simple cycles from i to i; then p u = 1 + C. Using (24), one obtains the 
triangle inequality for proximities: 

Pij +Pik -Pjk -Pu < Pjt +Ptk -PjtPik -1 - C = (pji -1)(1 - p ik ) - C < 0. 

To prove the transit property, note that p it — p ik p kt , and using (24), we have p it < p ik . 

Now prove monotonicity. Item 1. Suppose that Ae kt is the increment of the weight of an existing edge or the 
weight of a new edge between k and t. Then Ap kt = Ae kt > 0. Let us show that whenever all edge weights are smaller 
than £ and {i, j} ^ {k,t}, Ap^ < Ae kt holds. If i = k, then Ap^ = Ap k j < Ae kt p t j, and the required inequality 
follows from (24). The cases i — t, j — k, and j = t are similar. It remains to consider the case {i,j} H {k, t] = 0, in 
which n > 4. Obviously Ap^ — Ae kt w, where w is the total (k, i)-weight of the paths from i to j that contain the 
new (reweighted) edge (kt), and the "(fc, t)-weight" of a path is the product of the weights of all its edges, except 
for the edge (kt). Prove that w < 1. Obviously, w is maximal in a complete multigraph, where, as is easy to check, 

n-2 n-l 

w = 2 (k — l)A^~4(e m) k . Let us show that in this case, w is less than the value p = Yl ^n-2( £ o m ) k °f the 

k=2 k=l 

proximity for two distinct vertices in a complete multigraph, which equals 1 by the definition of s . Juxtapose the 
coefficients at the same exponents of (e m) in the expressions for w and p. It is easy to verify that the inequality 
2(k — 1)A^Z_\ > A^Z_2 has a unique solution: n = 4, k = 2. Thereby, the statement is proved in the case of n > 4. 
Finally, for n — 4 we have p = e m + 2(e m) 2 + 2(e m) 3 and w = 2(e m) 2 ; therefore, w < p as well. A similar 
proof applies to multidigraphs. 

Item 2. The statement follows from Ap ik — and Ap it > 0. 

Item 3. We have Ap^ ^ = 0, as the edge (arc) (kt) does not belong to any path from i x to i 2 - □ 

Proof of Proposition 2. Symmetry, nonnegativity, reversal property, and disconnection condition follow 
easily from the definition of connection reliability. Diagonal maximality in a nonstrict version follows from the facts 
that p u = 1 and p 4 - < 1, i,j = 1, . . . ,n. If all edge/arc weights arc less than 1, then, obviously, p^ < 1 at j ^ i; 
therefore p l} < p H . 

The proof of the triangle inequality for proximities mimics the corresponding proof for path accessibility. 

Transit property (in the form specified in Proposition 2) follows from the equality p it — p ik p kt , which is valid 
under the hypothesis of this property. 

Prove item 1 of monotonicity for multidigraphs. This proof will also be applicable to multigraphs. Let a 
state of a multidigraph, all of whose arcs are assigned some intactness probabilities, be any of its spanning subgraphs. 
The arcs of the subgraph are interpreted as the only intact arcs of the original multidigraph. By the assumption of 
independence of failures, the probability of a state is the product of the intactness probabilities of the arcs entering 
into the state and the failure probabilities of the lacking arcs. Let a new arc from k to t be added. Note that Ap^ 
is the total probability of those states in which 

(1) the new arc (kt) is present, 

(2) there is a path from i to j, and 

(3) the removal of the arc (kt) leaves no path from i to j. 

Note that in all these states, the removal of (kt) does not leave any path from k to t either (otherwise the 
removal of this arc would not have broken a path from i to j). Therefore, the specified total probability is a summand 
of Ap kt , and hence, Ap kt > Ap^. Whenever all arc weights are strictly less than 1, there is at least one state whose 
nonzero probability is a summand of Ap kt , but does not enter into Ap^: in this state the new arc (kt) is solely intact, 
and the desired inequality is strict. All these conclusions are preserved when the weight of an arc (kt) increases. 
This is because the connection reliability is affinely related with each arc weight. 

Item 2 of monotonicity is true, since Ap ik = 0, Ap it > 0, and Ap it > when all arc/edge weights are strictly 
less than one. Item 3 is valid, as the edge (arc) (kt) does not belong to any path from i x to i 2 - □ 

Proof of Proposition 3. Symmetry, nonnegativity, reversal property, and disconnection condition follow 
from the definition of route accessibility. 

Prove diagonal maximality for multidigraphs. In talking about route accessibility, we always consider a 
family of graphs with a specified greatest possible number of multiple edges (arcs) m and with edge/arc weights 
smaller than e max = (m(n — Suppose that T is a weighted multidigraph that belongs to such a family; i and 
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j =/= i are arbitrary vertices of T; e < £ max is the maximum among the arc weights in G. Consider the multidigraph 

T' constructed by removing all arcs directed to i from the complete multidigraph with the multiplicity of all arcs m 

and the weight of all arcs e. Obviously, for T', p' u = 1, and p\j = p' ik for any k ^ i. Particularizing the equality 

P'(I — E') = I for the ii-entry of P'(I — E'), wc derive p'-„- = -z , £m nS , and consequently, p', > p'- ■, since 

v j r v i — (n — 2)em %l 13 

e < (m(n — 1)) . If some arcs are removed from V or some weights of arcs are reduced (let the resulting graph 

be T"), then p u does not change, whereas p^ can only decrease. Now, let an arc from k ^ i to i be added to T" . 

By virtue of Proposition 4 (the proof of which is given below), in this case Ap u — Ap^ = hp'^ k (p^ i — p"j) > 0, and 

thus, p u > p^ remains true. Similarly, p u > p^ is preserved at the consecutive addition of other arcs directed to i. 

Hence, p u > p^ is also valid for T, and the diagonal maximality is proved. The fulfillment of this property for any 

multigraph G is ensured by its validity for the symmetric multidigraph Y with the same matrix E. 

Now we prove triangle inequality for proximities in the case where the weights of all edges (arcs) do not 

exceed (ran) -1 . First, consider the digraph V that differs from the complete digraph by the lack of all arcs directed 

to i. At j = i or k = i, the triangle inequality for proximities reduces to equality, so assume that j ^ i and k ^ i. 

Let each arc of T' have weight e = 1/n. Using the equality (7 — E')P' = I for the entries ij, ik, and ii of (I — E')P' , 

one obtains 



£ 1 



Pij Pik 



l-{n-2)e 2' 
Pu = 1, 

hence, p' u - p\j - p' ik + p' jk > p' u - p'^ - p' ik = 0. We shall prove now that no change of V can decrease p u —p^ —p ik . 
Indeed, if some arcs are removed from V and/or the weights of some arcs are reduced, p u does not change, whereas 
p^ and p ik can only decrease; therefore, p i{ —p^ ^Vih > is preserved. Furthermore, if for some digraph T this 
inequality is valid, then the addition of any arc ti to T cannot violate it, since, by Proposition 4, 

Ap i4 -Apy -A Ptk = h(t)p it (p u -p i:j -p ik ) > 0. 

Thus, the triangle inequality for proximities is valid for any digraph. The fulfillment of this property for multidigraphs 
is proved by replacing the set of arcs between a pair of vertices with a single arc with the total weight, which reduces 
the problem to digraphs. The fulfillment of the property for any multigraph is ensured by its validity for the 
symmetric multidigraph with the same matrix E. 

Transit property for multidigraphs will be proved by contradiction. Let Y be the multidigraph with the 
minimum number of arcs among the multidigraphs that violate the transit property. Then V has a path from i to 
k, t ^ k, and any path from i to t contains k, but p ik < p it . From the diagonal maximality, k ^ i. Let (ij) be 
the first arc of an arbitrary path from i to k, and let V be the multidigraph obtained by removing the arcs (ij) 
from r. Then, after adding the arc (ij) to T', one has Ap it > Ap ik . Indeed, if T' has no path from i to k, then 
Pik = Pit = m r", and Ap it < Ap ik would have been in contradiction with p ik < p it in Y. If, otherwise, Y' contains 
a path from i to k and Ap it < Ap ik , then Y' violates the transit property, which contradicts the minimality of Y. 
Further, by Proposition 4, Ap it —Ap ik = hp'n(p'j t - p'j k ), where h > 0, and Ap it > Ap ik implies p' jt > p' jk . By the 
construction, Y' has a path from j to k, and any path from j to t contains k. Hence, Y' breaks the transit property, 
which contradicts the minimality of Y. Transit property for any multigraph is proved by turning to the multidigraph 
with the same matrix E. 

To prove item 1 of monotonicity in the case of multidigraphs, note that, by virtue of Proposition 4, Ap kt = 
hp kk p tt and Ap^ — hp ik p t j . Now, the required statement follows from the diagonal maximality and can be extended 
to multigraphs by a standard trick. Similarly, item 2 of monotonicity follows from the formula Ap it ~Ap ik = 
h(PikPtt~PikPtk) an d diagonal maximality. Item 3 is not true, since under the hypothesis of monotonicity, some 
routes from i 1 to i 2 that contain the edge (arc) (kt) can appear or increase their weight. □ 

Proof of Proposition 4. Let A(I-E) = (I - E') - (I - E). Note that A(I-E) = XY, where X = (x a ), 
i = 1, . . . , n, is the column vector with entries x kl = —Ae kt and x {1 — for alii ^ k; Y = (y\j), j = 1, . . . ,n, is the 
row vector with entries y lt — 1 and y X j = for all j ^ t. According to [33, Sec. 0.7.4], 

p ' = p -YTYpx pxyr 

It is straightforward to verify that (— 1+ y PX ) = —h/Ae kt and PXYP = —As kt R, and thereby the proposition is 
proved. □ 

Proof of Proposition 5. Let us prove item 1 of monotonicity (all the other statements are proved in 
[25]). By item 1 of Proposition 7 from [25], Ap kt = h(p kk -p kt )(p tt -p tk ) and A Pij = h(p lk -p it )(p jt -p jk ), where 
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h > 0. Diagonal maximality implies Ap kt > 0. If Ap { j > 0, then (p ik ~p it ){p- t —Pjk) > 0. For definiteness, we assume 
that p ik —p it > and p Jt ~Pjk > (* ne complementary case is treated similarly). Then, by item 2 of Proposition 6 
from [25], if i 7^ ft, then G contains a path from i to ft, such that the difference (j> uk —p u t) strictly increases as u 
progresses from i to k along the path. Hence, p kk —p kt > p ik ~Pn- Similarly, p tt —p tk > Pj t ~Pjk whenever j ^ t . 
Using the above expressions for Ap kt and Ap^ , we get Ap kt > Ap^ . □ 

Proof of Theorem 2. Equation (13) follows from the matrix-forest theorem [25] applied to the weighted 
multigraph G that differs from G by the weights of edges only: for alH, j = 1, . . . ,n and p = 1, . . . , a ij7 (e^-)' = TE %- 

□ 

Proof of Proposition 6. This equality holds by virtue of the following three facts, which are true for 
any ft = 0, . . . , n — v and for any i,j,i 1: i 2 = 1, . . . , n such that i l ^ i 2 : (1) Tk = U T % k 3 , (2) T k 3 H T k 3 — , an d 

{3)e(^)=e(jf). □ 

Proof of Lemma 1. Let j E V%. The desired statement follows from the following fact: each spanning 
rooted forest from J%_ v can be put into correspondence with \Vi\ spanning rooted forests from Tn-v'- the latter 
forests have the same weight each and only differ by the root in the component that contains i; each element of Tn-v 
enters the correspondence exactly once. For j £Vi, the statement follows from Tn- V = - D 

Proof of Proposition 7. First, we prove that Va ^ 0, det(L + aJ) ^ 0. As the matrix L + aJ is 
reducible to a block-diagonal form, where the blocks correspond to the connected components of G, it suffices to 
prove its nonsingularity in the case of connected multigraphs (including the multigraph with one vertex and without 
edges — the point graph). Assume, on the contrary, that for some connected multigraph G, dct(L + a J) =0. Then 
there exists a vector b = (b ll . . . , b n ) T ^ such that (L + a J)b = 0, where = (0, . . . , 0) T . Note that the entries of 
Lb sum to zero, whereas the entries of a Jh arc all equal. Therefore, Lb = a Jh — 0. It follows from Lb = that 
b x = b 2 = . . . = b n , hence, by a Jh = 0, we have b = 0. This contradiction proves the invertibility of L + a J. To 
complete the proof, we will need a simple lemma. 

LEMMA 2. For any matrices A and B, if A and B are invertible and AJ — JB — a J (a G R , a^0), 
then A- 1 J = JB' 1 = aT 1 ] . ' 

Proof of Lemma 2. Prcmultiplying A J — a J by A" 1 yields J = aA^ 1 J. The statement regarding the 



matrix B is proved similarly. □ 
Note that the following equalities hold true: 

JL = LJ = 0, (25) 

f = J, (26) 
and, by Lemma 2 and Theorem 2, for any r > 0, 

(7 + TLY 1 J = J, (27) 

{L + jy 1 J^J. (28) 
Using Eqs. (25), (26), and (28), we obtain 

QL=(L + jy 1 L -JL = (L + Jy\L + J - J) = I - (L + Jy 1 J = I-J, (29) 

QJ = (L + Jy 1 J-J 2 =0. (30) 
Consequently, for any a ^ 0, we have 

(Q + a' 1 J)(L + a J) = I - J + J = I, 
whence Q + a -1 J = (L + aJ)~ 1 . □ 



Proof of Proposition 8. By (29), QL = I — J. Similarly, LQ = I — J. Thus, the first condition in the 
definition of the Moore-Penrose generalized inverse is checked. Next, using Lemma 2, (25), and (26), we have 

LQL = L(I -J) = L, 

QLQ - (I-J)Q = Q-JQ = Q-J(L + Jy 1 + f=Q-J+J = Q, 
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which completes the proof. □ 
Proof of Proposition 9 reduces to the following transformations based on Eqs. (25)-(28) and Corollary 1: 

( Hm r ((/ + Thy 1 -fj+fj(L + J) 

= lim t ({I + tLY 1 L + {I + tLY 1 J - J L - f \ +JL + J 2 
= lim r(7 + tL)~ 1 L + J = lim {I + tL)' 1 ^ + tL-I) + J 

T — >00 T — >00 

= I - lim (I + rLy 1 + J = I. 

It now remains to apply Proposition 8. □ 

Proof of Theorem 3. For j $ Vi, the statement follows from Theorem 2, Proposition 9, and the definition 
of J. For j e Vi, using the same and Lemma 1, we have 



1% = limr 



/n—v \ n—v / 

k=o j ■ - ' ■ 

n—v 1 

\ E r k e{Tk) 

\ fc=0 



lim k=0 



E r k e(n) 



k=0 



£^J~n— v) 



E ^e^fc) 

fe=0 

Proof of Proposition 10. Symmetry, nonnegativity, and disconnection condition follow from (23). 

Let us prove diagonal maximality. The matrix J possesses this property in the nonstrict version p u > ; 
therefore, by virtue of (23), it suffices to prove it for Q n - v -i. By definition, for all i,j = l,...,n, q n - v -nj — 
s^n-v-i) holds, where Tn- v -\ is the set of all spanning rooted forests in G that contain n — v — 1 edges and have 
i and j in the same tree rooted at i. Obviously, F^-v-i — fn-v-i- Show that J r n t _ v _ 1 \ J^-v-i - Consider an 
arbitrary F £ J^_ v , remove from F any edge that belongs to the path from i to j, and arbitrarily choose the root in 
the newly formed component containing j. The resulting subgraph belongs to T n % - v -\ \ Tn- v -\- By the assumption 
of positivity of the edge weights, we have e(j r "_ t ,_i) > e(j%_ v _ 1 ), whence q n _ v _ l u > q n _ v _ ltij , and the property 
is proved. Note that diagonal maximality can be similarly proved for Qi, . . . , Q n -v-2] for Qo it is obvious, whereas 
for Q n — v — s(j- n —v) J it is valid in a nonstrict version. 

Prove the triangle inequality for proximities. The strict statement (for j — k and i ^ j) follows from the 
diagonal maximality. Prove that p^ +p ik —Pjk ^ Pa- F° r i = j or i = k, we have the identity. Suppose that i ^ j 
and i k. Obviously, f^jUJ^^ C T%_ v _ x . Hence, 

£(K j - v -i) + <K-v-i) - s(K 3 - v -i^K- v -i) = e(^ j _v-i^K- v -i) < <^-v-i)- (31) 

Define T n 3 - v -\ as T t n J _ v _ 1 n J^ l k _ v _ 1 and note that J%* v -i differs from jF^-i = ^'-,-^^-,-1 om Y b y 
the roots of the trees that contain i, j, and k simultaneously. Therefore, 

^.in^,.,) = e(j^Vi) = < <?i-v-i)- (32) 

Summing up the extreme left and extreme right parts of (31) and (32), we obtain 

£ (^n-v-l) + £ (J~ n k - v -i) < £{j-n- v -i) + si^i-v-i), 

which, by the definitions of Q n - v -\ and J and (23), implies the triangle inequality for proximities. 

Prove transit property. The required inequality is valid for the matrix J in a nonstrict form, so by virtue 
of (23), it remains to prove it for Q n _„_i. Obviously, T n t _ v _ l C T tk -v-i- To prove that T lk -v-i ^-^n-v-i ? 
consider an arbitrary F £ J-^-y Remove from F any edge that belongs to the path from k to t and arbitrarily 
choose the root in the newly formed component containing t. The resulting subgraph belongs to T %k - v -\ ^Fn-v-i- 
By the assumption of positivity of the edge weights, we conclude that e{j^_ v _ l ) > e{j^_ v _- i ), and the property is 
proved. 

To demonstrate the violation of monotonicity, it is sufficient to consider the graph G with the vertex set 
V(G) = {1,2,3} and one edge (1,2) whose weight is unity. Let an edge (1,3) with weight unity be added to G. 
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Here, the accessibility via dense forests provides (for any a^O) Ap 13 = — l/9<5/36 = Ap 12 (which violates item 1 
of monotonicity) and Ap 23 = —4/9 < 5/36 = Ap 21 (which violates item 2). With the same example, item 3 is also 
trivially violated, as Ap 2 2 = 11/36 > 0. By adding an appropriate number of isolated vertices, similar examples can 
be generated for all n. 
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